Solution of the linear quadratic regulator problem of black box linear systems using reinforcement learning

نویسندگان

چکیده

In this paper, a Q-learning algorithm is proposed to solve the linear quadratic regulator problem of black box systems. The only has access input and output measurements. A Luenberger observer parametrization constructed using control new obtained from factorization utility function. An integral reinforcement learning approach used develop approximator structure. gradient descent update rule estimate on-line parameters Q-function. Stability convergence under assessed Lyapunov stability theory. Simulation studies are carried out verify approach.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Linear Quadratic Regulation using Reinforcement

In this paper we describe a possible way to make reinforcement learning more applicable in the context of industrial manufacturing processes. We achieve this by formulating the optimization task in the linear quadratic regulation framework, for which a conventional control theoretic solution exist. By rewriting the Q-learning approach into a linear least squares approximation problem, we can ma...

متن کامل

Efficient Reinforcement Learning for High Dimensional Linear Quadratic Systems

We study the problem of adaptive control of a high dimensional linear quadratic (LQ) system. Previous work established the asymptotic convergence to an optimal controller for various adaptive control schemes. More recently, for the average cost LQ problem, a regret bound of O( √ T ) was shown, apart form logarithmic factors. However, this bound scales exponentially with p, the dimension of the ...

متن کامل

Numerical solution of linear control systems using interpolation scaling functions

The current paper proposes a technique for the numerical solution of linear control systems.The method is based on Galerkin method, which uses the interpolating scaling functions. For a highly accurate connection between functions and their derivatives, an operational matrix for the derivatives is established to reduce the problem to a set of algebraic equations. Several test problems are given...

متن کامل

Reinforcement Learning Applied to Linear Quadratic Regulation

Recent research on reinforcement learning has focused on algorithms based on the principles of Dynamic Programming (DP). One of the most promising areas of application for these algorithms is the control of dynamical systems, and some impressive results have been achieved. However, there are significant gaps between practice and theory. In particular, there are no con vergence proofs for proble...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Information Sciences

سال: 2022

ISSN: ['0020-0255', '1872-6291']

DOI: https://doi.org/10.1016/j.ins.2022.03.004